Model Selection

Large - scale corpus training

# Large - scale corpus training

Tucano-2b4 is a large - scale language model that is natively pre - trained specifically for Portuguese. It is based on the Transformer architecture and trained on the GigaVerbo dataset with 200 billion tokens.

Large Language Model

Transformers Other

Roberta Base Turkish Uncased

This is a RoBERTa base model based on Turkish. The pre - training data is sourced from Turkish Wikipedia, the Turkish OSCAR corpus, and some news websites.

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase